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1. Introduction 

Recently, real large-scale network data have been analyzed in various fields, 
and the corresponding random graphs have been studied. Many of these 
graphs exhibit the power-law form of the degree distribution, with the 
power-law exponent typically between 2 and 3. In view of this, physicists 
have proposed stochastic algorithms for generating growing networks. The 
underlying assumption of these growing networks is that a vertex with a certain 
fixed number of edges is added to the graph one by one at each discrete time 
step. To obtain a power law, the growth mechanism is usually supplied by 
so-called preferential attachment, which stipulates that each newly introduced 
edge is more likely to be connected to a vertex (which already exists in the 
graph) that has a larger degree. For a review of the studies on such models see 

DEI- 

However, not all real networks are growing; algorithms for generating 
nongrowing realistic graphs could be more appropriate for real situations in 
which the number of vertices does not change rapidly. In this regard, a type 
of random graph in which each of n vertices is assigned a random variable 
(weight) was proposed and its mean behavior has been analyzed EJ El EJ. 
Interestingly, the power-law degree distribution can emerge even with a weight 
distribution that is not equipped with a power-law. For example, mean-field 
analysis suggests a power-law degree distribution with scaling exponent —2 
when the vertex weights are independent and identically distributed (i.i.d.) 
random variables obeying the exponential distribution. 

Our first study is regarding such graphs. Formally, our model consists of n 
vertices labeled 1, . . . , n and an i.i.d. sequence of random variables X\, . . . , X n 
with F denoting their common distribution function. We associate the random 
variable Xj with the vertex labeled i. Given a fixed threshold value 9 > 0, we 
connect the vertices i and j by an edge if and only if X{ + Xj > 9 and 
i 7^ j. Let Gg be the random graph thus produced. A simple coupling argument 
shows that G G >} < G G e } whenever & > 9. 

Let D n (i) := #{j : G Gg}, i.e. the degree of the vertex % in the 

graph Gg. Note \D n {%) : i > 1} are identical in distribution and let D n denote 
a random variable with this common distribution. The distribution D n can be 
obtained as follows: given (n + 1) vertices, conditioned on the event X± = x, 
vertices j G {2, . . . , n+1}, will connect to the vertex 1 if and only if Xj > 9 — x. 
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Therefore, for < k < n, we have 

F(D n+1 = k) = J°° Q [1 - F{0 - x)] k [F(9 - x)] n - k F{dx). (1) 

Here F(dx) represents the probability measure on the real line representing the 
law of X\. Equation (JTJ) allows us to obtain the following asymptotic result: 

Theorem 1 As n — > oo, 

— ^l-F(6-X x ). (2) 
n 

Here ==>- stands for convergence in distribution. 

In data- analysis context, we are often concerned with the degree 
correlation between a pair of vertices |I] |2| • Here we discuss the asymptotic 
properties of the graph. Our first result is 

Theorem 2 D " ( - 1 - > and Dn ^ 2 ' are asymptotically independent. 

This asymptotic independence breaks under the condition that the vertices 
1 and 2 are connected. However, we need to be careful about this statement, 
in the sense that we will need some conditions on the distribution function F 
of the weight of a vertex. In particular we assume that 

Assumption 1: There exists u and v in the support of the distribution function 
F (i.e. for any e > 0, F{u + e) > F(u — e) and F(v + e) > F{y — e)) such that 
u < 6/2 < v and u + v > 9. 

Theorem 3 Given that the vertices 1 and 2 are connected, under the above 
assumption Dn ^> and Dn ^ are not asymptotically independent. 

The importance of the assumption can be understood by looking at the 
situation when the assumption does not hold. In that case, the vertices can 
be classified into two groups: those which have weights 8/2 or less and those 
which have weights more than 9/2. In the random graph constructed, each of 
the vertices of the former group remain isolated, whereas the vertices of the 
latter group form a complete graph among themselves. Also, by the strong law 
of large numbers, the probability that a vertex belongs to the former group 
converges to F(8/2) and the probability that a vertex belongs to the latter 
group converges to 1 — F(6/2). Moreover, given two vertices (1 and 2 (say)) 
are connected, they must belong to the latter group and then we have 



P 



lim Dn ^ = 1 - F(6/2), lim Rl^l = i _ F(6/2)\ vertices 1 and 2 are connected 

(woo n rwoo n 



p 



lim — ^— - = 1 — F(6/2) [vertices 1 and 2 are connected 



n— >oo n 



(3) 
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Thus we obtain conditional asymptotic independence. 

A remarkable characteristic of real graphs is the clustering property PjJ |2] . 
The clustering property means an abundance of connected triangles in the 
random graph. The threshold model exhibits such a clustering property as 
shown exactly in El • 

To formalize this, let h : 1R 3 — > R be given by 



h(x 1 ,x 2 ,x 3 ) 



*-{xi+X2>0,X2+X3>9,X3+xi>6 



(4) 



Also let 
T 



F 3 (9) 



Ci(F) 



#{{i,j,k) : 1 <i < j < k <n, 

Xi + Xj > 9, Xj + Xk > 9, Xk + Xi > # j , 
E[h(X 1 ,X 2 ,X 3 )] 

= F(dxi)F(dx 2 )F(dx 3 )h(xi, x 2 , x 3 

Jr Jr Jr 

= P(Xx + X 2 > 9,X 2 + X 3 > 9,X 3 + X l > 

/ / F(dx 2 )F(dx 3 )h(X 1 ,x 2 ,x 3 
Jr Jr 



(5) 



E 



(F 3 (9)) 2 >0. 



(6) 
(7) 



Note that T n counts all triangles in the graph with n vertices. 
The asymptotic results are 



Theorem 4 As n 

T 



(a) 



G) 



OO, 

F 3 (9) almost surely; 



r T 



( b ) Vn~ 7^- F z(°) VKi{F)Z where Z is a standard normal random 
variable. 

The method used to show the above results may be generalized easily to obtain 
a count of not only triangles but subgraphs in Gg isomorphic to a fixed graph. 
In data-analysis contexts, a fixed subgraph is called a motif of the graph. 
Depending on the types of real networks (e.g. Internet, gene networks, neural 
networks, social networks), there are some sorts of small motifs that appear in 
an entire graph significantly more than in the random graphs. These motifs 
are relevant to functional roles such as signal transduction and information 
processing apposite to each application jBJEl- Our results, Theorems |H1 and M 
in Section 4, obtain limit theorems for the motifs of the threshold model. 
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Besides the extension to general graphs, the above theorem may be 
extended for local triangle counts, i.e., the number of triangles containing a 
specified vertex. The clustering coefficient, which is a quantity often used to 
evaluate the degree of clustering property, is defined as the normalized number 
of locally counted triangles averaged over all the vertices. Our local results 
below show that the vertex-wise clustering coefficient satisfies the central limit 
theorem, which validates the use of expectation in physics community. 

We fix the vertex 1 and define T n (l) as the number of triangles in the set 
of vertices {1,2, ... ,n, n + 1} of which the vertex 1 is a site. In other words, 

T n (l) := #{2 < i <j < n+ 1 : h(Xi,Xi,Xj) = 1}. (8) 

Theorem 5 As n — > oo, 

r„(i) 




, , F(dxi)F(dx 2 )h(X,x 1 ,x 2 ) (9) 

where X is an independent random variable identical in distribution to X\ . 

Besides the above dimensionless random graph model, a spatial model 
has been proposed in Consider a homogeneous Poisson point process of 
intensity A on W 1 . We insist that the origin, 0, is a point of the process. Let 
{0 = £o> iii £2, • • • } be an enumeration of the points of the process. Associated 
with each point £j is a random variable Xj. We assume that {X , X\, X 2 , . . . } 
is an i.i.d. sequence of random variables with common distribution function F. 
The random graph Go,p is obtained by connecting and ^ by an edge if and 
only if (Xi + Xj) > 9\£i — ^jl 13 where 6 and (3, the parameters of the model, are 
real numbers. 

Define the degree of the origin in a sphere of radius r as 

A r := #{z > 1 : (X + X { ) > ^/and |&| < r}. (10) 
Given a fixed ieK, let 

f(r;x) = f(\r\;x) := 1 - F(6\r\ p - x). (11) 

Define 

C r (x) := / r d - 1 f(r;x)df, (12) 







and consider the two following cases: 
(i) As r — > 00, C r (x) — > C{x) := J °° f d_1 /(f, x)df < 00 for every x G M. 
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(ii) There exists a sequence {C r } and a function g(x) such that C r — > oo and 



g(x) as r — ► oo for every x G 



For the first case, we have 



Theorem 6 If C r (x) — > C(x) < oo /or every x G IR as r oo, £/ien we nave 

A r A (13) 

where the characteristic function of the random variable A is given by 4>&{t) = 
L F(dx) exp(— \cdC(x)(l — exp(zi))) where d represents the volume of the 
id — 1) - dimensional unit sphere. 

Remark The random variable A represents the degree of the origin in the 
random graph. 

For the second case, we have 

Theorem 7 Suppose that there exists a sequence {C r } such that C r — > oo and 
g(x) as r — > oo /or every s£l. We have 

X -' VA ''- > Z + y^c~ d g(X ) as r ^ oo (14) 



where Cd is as defined in Theorem^ 

We end this section with an example of F satisfying the condition in Theorem 
□ Fix (3 = 1 and d = 2. Define F : [0, oo) -> [0, 1] by 

F(x) := 1 - Cx- a (15) 

where < a < 2 and C > 0, and 

C r :=C6- a r 2 ~ a /{2-a). (16) 

Simple computations can be carried out to verify that the conditions of 
Theorem [7| are satisfied in this case with g(x) = for all x G R. In the 
next few sections we prove our results. 



2. Degree D n of a vertex 

To prove Theorem ^ observe that, since (n + l)/n — > 1, it is enough to show 
that D n+ i/n converges weakly to the required random variable. For £ G M., we 
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E exp 



itD n -\ 



E( itk 
exp — 
\ n 



n 



[1 - F{6 - x)] k [F(6 - x)] n - k F(dx) 



00 k=0 

00 



H exp f — ) [1 - F(f) - .r)] k [F(0 - .,■)}"-'•■ FUl.r) 



-00 
00 



-00 
00 



-00 
00 



it 



F(dx) 



1 - F{6 - x)) exp ( - J + F(0-z) 

+ F(0-x) 

exp (it (1 — F{6 — x)))F(dx) as n — > 00, 



;i-f(^-x)) ( i + - + ( I 



l + -(l-F(0-a;)) + o( - 
n \ n 



where the limit follows from dominated convergence theorem because 

., \ 2 

it 



;i7) 



1 - F(9 - x)) exp f - j + F(0 - ar) 

= [(1 - F(0 - x)) cos(*/n) + F(0 - x)} 2 + (1 - F(0 - x)f sin 2 (t/n) 
= (1 - F{6 - x)f + 2F(0 - x)(l - F(0 - x)) cos(t/n) + (F(6 - x)f 
< I- (18) 
This proves Theorem ^ 

3. Pair correlation 

Now suppose there are n + 2 vertices labeled 1, 2, . . . n, n + 1, n + 2 with 
corresponding random variables Xi, . . . , X n+ 2- Define 

Am := D n+2 (n + 1) = #{j : 1 < j < n + 2,j ^ n + 1, X, + X n+1 > 0} (19) 
A,2 := D n+2 (n + 2) = #{j : 1 < j < n + 2,j ^ n + 2,X, + X n+2 > 0}.(2O) 
We have, for < k, I < n + 1, 



P [D nA = k, D n>2 = I] = P [D nA = I, D n>2 = k] . 



(21) 
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For k > I 
P [D n>1 = k, D n>2 = I] 

/oo poo 
/ l {a>b} F(da)F(db)F [#{1 < j < n : a + X j > 9} + l {a+b>e} = k, 
-oo J —oo 

#{l<j<n:b + Xj>e} + l {a+b>e} = I] 

/■CO POO 
/ l {a>6} l {a+6>(?} F(rfa)F(rf6)P [#{1 < j < n : a + X j > 9} = k - 1, 
-oo J -co 

#{l<j<n:b + X j >9} = l-l] 

/oo /»oo 
/ l {a >6 } l {a+ 6< e} F(rfa)F(rf6)P [#{1 < j < n : a + Xj > 9} = k, 
■oo J —oo 

#{1 < j < n : b + Xj > 9} = 1} 

/oo poo 
/ l{ a >6}l{o+6>fl}ffn(0; a,b;k-l,l- l)F(da)F(db) 
oo J — oo 

/oo /»oo 
/ l {a>6} l {a+ 6< e} ^ t (^;a,6;A;,/)F(rfa)F(rf6), (22) 
■oo J — oo 

where, for a > b, 
g n (9;a,b;k,l) 

■ = P [#{1 < j < n : a + X, > 9} = k, #{1 < j < n : b + Xj > 9} = I] 
= P [#{1 < j < n : Xj > 9 - a} = k, #{1 < j < n : Xj > 9 - b} = I] 
= P [#{1 < j < n : Xj > 9 - b} = Z, #{1 < j < n : 9 - a < Xj < 9 - b} = k - I] 

otherwise. 

(23) 

For = / similar calculations show 
P [D n>1 = k, D n>2 = k] 

/OO pOO 
/ l{a>b}l{a+b>o}gn(d] a, b; k - 1, k - l)F(da)F(db) 
-oo J — oo 

/oo poo 
I l{a>b}l{a+b<9}9n{0; a, b; k, k)F(da)F(db) 
oo J — oo 

/•OO poo 
/ l{a<b}l{a+b>e}9n(8] b,a;k-l,k- l)F(da)F(db) 
-oo J — oo 

/•oo /»oo 
/ l {a < 6} l {o+6 <«}^„(0;6,a;A;,A;)F(da)F(d6). (24) 
-oo J — oo 



-CO J —oo 

Now let 



d n (l) := #{j : 1 < j < n, X, + X n+1 > 9} (25) 
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and 

d n {2) : = #{j : 1 < j < n, X j + X n+2 > 6}; (26) 

then 

> 4(1) > D nA - 1 (27) 

and 

A>,2 > d n {2) > D n>2 - 1. (28) 

Thus the asymptotic distributions of and ^) are 

identical, and we work out pair correlation with d n (l) and rf n (2) instead of 
D n> i and D n>2 . 

Now, for all < fc, / < n, 

P [d n (l) = k, d n (2) = l]=F [d n (l) = /, d n (2) = k] , (29) 

and, for k > I 



/oo />oo 
/ l{a>6}»n(0; a, 6; fc, l)F(da)F(db), (30) 
-oo J —oo 



while, for A; = / 

/oo ;>oo 
/ l {a > b }9n(d;a,b;k,k)F(da)F(db) 
-oo J — oo 



-oo J —oo 
POO poo 



/oo poo 
/ l {a <b}gn{e-b,a-k,k)F{da)F{db) 
-oo J —oo 

l{a>b}g n (0; a, b; k, k)F{da)F(db) 



oo J —oo 
oo poo 



— oo </ — oo 
oo />oo 



l {a =6}Sn(0; a, 6; fc, k)F{da)F{db). (31) 



— oo ^ — oo 



Now let — oo < s,t < oo. To derive pair independence, let us consider the 
characteristic function. We have 

n n 

E [exp(isd n (l) + ^d n (2))] = exp(is£; + i*Z)P [d n (l) = k, d n {2) = I] . 

k=0 1=0 

(32) 

We break the above double sum into three parts, when (i) k > I, (ii) k < I and 
(iii) k — I. 
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exp(isk + itl)W> [d n (l) = k,d n (2) = I] 

k,l:0<Kk<n 
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exp(isk + itl) 



k,l:0<l<k<n 



^oo POO 



±{a>b}g n (0; a, b; k, l)F(da)F(db) 



oo J — oo 



/oo poo 
/ \ {a>b} F(da)F(db) Yl exp(isk + iti)g n (8;a,b;k,l). (33) 



k,l:0<l<k<n 



The term 



exp(isk + itl)g n (9;a,b;k,l) 

k,l:0<Kk<n 



= exp(is/c + it/) 



71! 



k,i.0<l<k<n 



l\(k - l)\(n - k)\ 



x 



■ If -| fc — Z r -| n— k 

l-F{9- b) F{6 -b)- F{6 - a) F{9 - a)\ 



k,l:0<l<k<n 



^ k<n eM ' S{k + t)] W^l)Kn ~ k)\ 

x [l - F(9 - b) 1 F(9 -b)- F(0 - a) F{6 - a) 
exp(is(k - I) + il(s + t)) 



n—k 



k,l:0<l=k<n 



l\(k - l)\(n - k)\ 



x [l - F{6 - b) 1 F{9 -b)-F{d-a) * ' F{6 - a 



n—k 



exp(i(s + *)) (1 - F(9 - b)) + exp(is) (F(0 - b) - F(9 - a)) + F(9 - a) 



exp(i/c(s + t)) 



n! 



k=0 



k\{n-k)\ 



\l-F{9-b)] k [F{9-a)} 



n—k 



exp(i(s + t)) (1 - F{9 - b)) + exp(is) (F(9 - b) - F(9 - a)) + F{9 - a 

-i n 

exp(i(s + t))(l-F(9-b))+F(9-a) . (34) 
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Thus 

exp(isk + itl)W> [d n (l) = k,d n (2) = I] 

k,l:0<Kk<n 

oo poo 



OO J — OO 



l {a>b} F(da)F(db) 

([exp(i(s + t))(l - F{0 - b)) + exp{is)(F{6 - b) - F{6 - a)) + F{9 - a)] n 
- [exp(i(s + t))(l-F(6-b)) + F(6-a)] n y (35) 
Similarly, for (ii), 



^2 cxp(isk + itl)F[d n {l) = k,d n {2) = l] 

k,l:0<k<l<n 

"OO POO 

/ l {a>b} F(da)F(db) 

— oo J — oo 



([exp(i(s + f))(l - F(0 - b)) + exp(it)(F(6 - b) - F(6 - a)) + F(9 - a)]™ 
- [exp(*(s + t))(l-F(#-&)) + F(#-a)] n ). (36) 

For (iii), 

exp(isA; + itk)¥ [d n (l) = k, d n {2) = k] 

k,l:0<k=l<n 

noo 
l {a>b} F(da)F(db) 
■oo 

(£exp(w* + ^) fc!(w n l fc) , (l " F {0 ~ b))\F{d - a)f- k ) 

/OO POO 
/ l {a=fe} F(rfa)F(rf6) 
■oo J — oo 

(g exp(. S A; + »tfc) fc , (w W l fc) , (l " ^ " - a))""*) 

/oo /-OO 
/ l {a>b} F(da)F(db) [(1 - F(0 - b)) exp(*(s + *)) + F(9 - a)] n 
oo J — OO 

/•oo /»oo 
/ l {a=6} F(rfa)F(rf6) [(1 - F(0 - b)) exp(i(s + t)) + F(9 - a$?) 
-oo J —oo 
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Combining the above and taking s = u/n and t = v/n, we have 
E [exp(w(4(l)/n) + iv(d n (2)/n))] 

l {a>b} F{da)F{db) 

[1 - F{6 - b)) + exp (f ) (F(9 - b) - F(0 - a)) + F{6 - a) 



oo poo 



oo ./ — oo 

i(u+i)) 



cxp 



(3* 



+ 
+ 



exp 

OO POO 



i(u+v) 



[I - F{9 - b)) + exp (|) - 6) - F(0 - a)) + F(0 - 



l {a=b} F(da)F(d6) 



oo J — oo 

OO /"OO 



— oo «/ — oo 



l {a>6} F(da)F(oto) 
iu 0(1 



•-xDl^J (1 — F(0 — b)) + F{9 — a) 
i(u + v) 0(1)' 



1 + 



+ 



n 



n- 



+ 1 



n 



rr 



(F(6 -b)- F(6 - a)) + F(6 - a) 



+ 



1 + !C»±£) + oy)\ 

n rtr J 



! 1 + - + ^ ) (F(6-b) -F(6-a)) + F(6-a) 



OO /"OO 



/ / l {a=6} F(da)F(d&) 

' — OO J — CO 

Letting n — > oo, we see that 



1 + 



i(u + v) 0(1) 



rr 



[1 - F(0 - &)) + F(0 - 4J9) 



lim E [exp(w(d n (l)/n) + zv(c? n (2)/n))] 

n— >oo 

/oo /*oo 
/ \ {a>b] F{da)F{db) {exp + v)(l - F(0 - 6)) + iu{F{6 - b) - F{6 
-oo J —oo 

+ exp[i{u + v){l -F(6-b)) + iv{F{6-b) - F{6 - a))}} 

/OO POO 
/ l {a=b} F(da)F(db) exp + v)(l - F(0 - b))] 
oo J —oo 

/oo 
exp(m(l - F(6 - a)))F(da). 
-oo 

Thus combining with the result in Section 2 we see that 
lim E [exp(iu(d n (l) / n) + iv(d n (2)/n))] 

n^oo 

= E [exp(iu(l - F(6 - X 1 )))} E [exp(zv(l - F{6 - X 1 )))\ 

= lim E[exp(zu(-D n (l)/n))] lim E[exp(zv(D n (2)/n))] , (41) 

n— >oo n— >oo 

i.e. we obtain the asymptotic independence as enunciated in Theorem |21 

Now we obtain Theorem El The joint conditional probability distribution 
of the weights of two fixed vertices, provided that these vertices are connected 



0)1 



(40) 
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by an edge, is given by 

H(da, db) = 1 ( a + ^ > e A F (da)F(db) (42) 
ap(v) 

where ap{6) := $™ l(a + b > 6)F(dd)F(db) is the normalizing constant. 
It is actually the probability that two vertices share an edge. Consequently, 
the conditional probability distribution of weight of a vertex provided that it 
shares an edge with another vertex is given by 

/oo 
H(da, db) 
-oo 

jr 0O l(a + b>9)F(db) 

= F{d<X) M9) 

1 — F(9 — a) , . . 
= jn\ — -F(da). 43 

In order to study the asymptotic dependence of d n {l)/n and d n (2)/n, we 
need to consider the difference between the expressions in the left and right 
sides of equation (JJUj). Using equations (j4^|) and the asymptotic limit of 
this difference is 

lim E [exp(iu(d n (l) /n) +iv(d n (2)/n))] 

n— »oo 

lim E[exp(iu(£> n (l)/n))] lim E [exp(iv(D n (2) /n))\ 



n— >oo 

oo 



(H(da, db) - G{da)G{db)) e ^-no-a)) e H\-F(fi-b)) _ ^ 

If the resulting conditional random variables are asymptotically independent, 
then for all u, v 6 R, the right hand side of equation (}4*4"|) must vanish. Now 
we claim that this will imply that the probability measures H(da, db) and 
G(da)G(db) on M 2 are identical. Indeed, let (M l5 M 2 ) and (Ri,R 2 ) be two 
random vectors on M? whose distributions are given by the probability measures 
H(da,db) and G(da)G(db), respectively. Consider the map / from M 2 — > M 2 
defined by / : (a, b) — > (1 — F(0 — a), 1 — F(0 — 6)). Then, the supposition that 
equation ()44|) vanishes implies that the characteristic function of /(M l5 M 2 ) is 
same as that of f(R±,R2). Hence their distributions are also same. Therefore, 

F[M l <(3 1 ,M 2 <(3 2 } 
= P [1 - F{9 - M x ) < 1 - F(9 - Pi), 1 - F(9 - M 2 ) <1-F(9- (3 2 )] 
= F[1-F(6-R 1 )<1- F{6 - pi), l-F(d-R 2 )<\- F{6 - p 2 )} 
= F[Ri<Pi,R 2 <p 2 }; (45) 
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i.e. H(da,db) and G(da)G(db) are identical. 

Now, we claim that, because of Assumption 1 (of Section 1), the 
probability measures H(da, db) and G(da)G(db) cannot be the same. Indeed, if 
the measures H(da, db) and G(da)G(db) are same, then for any subset A C M 2 , 
J A H(da,db) = f A G(da)G(db). Using the equations (jUJ) and (jSJ), we have 

~l(a + b>6) 1 -F{6-a) 1 -F(6-b) 



a F (8) 



a F (8) 



a F {6) 
This will imply that 

l(a + b > 6)a F (6) - (1 - F(0 - a))(l - F(0 - b)) 



F(da)F{db) = 0. (46) 







(47) 



F x F almost surely. 

Now fix u and v as in Assumption 1 and let e be such that u + v — 4e > 9 
and u + 4e<6/2<v — 4e. Let a, 6 G (it — e, u + e] . Then 



l(a + 6 > = 0. 



(48) 



However, ^ — a < 6 — u + e < v — 3e < v — e, so 1 — F(0 — a) > 1 — F(y — e) 
> F(v + e) - F(u - e) > 0. Similarly, l-F(6-b)> 0. Thus 

(1 -F(6-a))(l -F{9-b)) > 0. (49) 

Hence, on a set of probability at least (F(v + e) — F(v — e)) 2 > we have 

l(a + b > 6)a F {6) - (1 - F{6 - a))(l - F(0 - 6)) < 0. (50) 

This contradiction completes the proof. 



4. Triangles 

Now we study the number of triangles in the graph. The number of triangles T n 
can be represented as a U-statistic (see JI]). The kernel function h : M. 3 — ► R 
defined in equation (|IJ) is clearly a symmetric function of Xi, a?2, ^3- Then, we 
have 




where C := {(i,j,k) : 1 < % < j < k < n} is the collection of all possible 
triplets and U n := j^Yl(i,j,k)eC Xj, X^) is the U-statistic obtained from 
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the kernel h. Theorem E(a) can be easily derived from Theorem A of ^T] 
(1980), page 190. 

Next define hi : R — > R as follows: 

h x {x) :=E[h(x,X 2 ,X 3 )] 

= P [X 2 > e - x, X 3 > 9 - x, X 2 + X 3 > 9} 

= F [mm{X 2 , X 3 }>9~ x, X 2 + X 3 > 9] . (52) 
Noting that 

E[h 1 (X 1 )]=E[E[h(X 1 ,X 2 ,X 3 )\X 1 ]}=E[h(X 1 ,X 2 ,X 3 )} = F 3 (e), (53) 
we have 

Ci(F) = Var(/i!pO) > (54) 

unless F is degenerate. The asymptotic normality of T n j (Z) , i.e. TheoremHf b) 
follows from Theorem A of ^1] (1980), page 192. 

As an aside, we note that the method of U-statistics employed above is 
more versatile. It may be applied to configurations involving any subgraph 
composed of finitely many vertices. 

Let G be a graph on k vertices (k < n) and suppose V := {v\, . . . ,Vk} 
and E be the vertex and edge sets of the graph, respectively. For a permuta- 
tion cr : V — > V, let us consider the graph G CT with vertex set V and edge set 
E a := {(a(vi),a(Vj)) : (vi,Vj) G E}. Let I := #{cr : E CT = E}. Thus I counts 
the number of symmetries of G. To illustrate this, consider the graph with 
vertex set as V = {1,2,3,4} and edge set E = {(1, 2) , (2, 3) , (3, 4) , (4, 1)}. 
For this graph I = 8; indeed any of the eight permutations 
(1, 2, 3, 4), (2, 3, 4, 1), (3, 4, 1, 2), (4, 1, 2, 3), (1, 4, 3, 2), (4, 3, 2, 1), (3, 2, 1, 4) and 
(2, 1, 4, 3) result in a graph which is equivalent to the original graph. 

Now given a graph G on the vertex set V :— {1, 2, . . . , n} with edge set E 
consider the subgraph G(i\, . . . , on the vertex set V(i\, ...,«&) = {ii, ... , ik} 
and edge set E(ii, . . . , i k ) = {(i t ,i s ) : (i t , i s ) 6 E and 1 < i ^ s < k}. Let 
Lfc be the set of all ordered fc-tuples from {1, 2, ... , n} and / : — > {0, 1} be 
defined as follows: 

,,. . . \ fl if (i s Jt) e E(i 1 ,i 2 ,...,i k ) for all (v a ,v t ) e E 
J{ii,i2,---,ik) = < , (55) 

[ otherwise. 

Thus, the number of subgraphs in G isomorphic to G is given by Z _1 T ra (G), 
where 

T n (G) := /(ii,«2, • • • ,ifc). (56) 

(ii ,12, •••,ife) SLj; 
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For the graph G$, the vertices i s and i t are connected by an edge if and 
only if X is +X it > 9. Thus the function / in the definition ()56|) may be replaced 
by the kernel function h : M. k — > {0, 1} defined by 

f 1 if X + X u > 9 for all (v s , v t ) E E 
h(X h ,X i2 ,...,X ik ) = { / . K ' (57) 

[0 otherwise, 

and we have 

T„(G)= ^ MX,,^,...,^). (58) 

(ii,i2,...,ifc)6l,fc 

This function /i need not be symmetric in its argument. Even if it were so, since 
the sum is over all ordered fc-tuples (ii, . . . ,ik) and not on A;-tuples (zi, . . . ,ik) 
such that ii < i 2 <■ ■ ■ < ik, T n need not be a U-statistic. 

To overcome these, we first define a symmetrized version of the kernel h. 

Let, 

h sym (xi,x 2 , . . . ,x k ) := Tj ^2 K x Hi)> x °-C2)> ■ ■ ■ > x Hk)) ( 59 ) 

where a : {xi, x 2 , ■ ■ ■ , Xk} — > {xi,x 2 , . . . ,x^} is a permutation, and P(k) is 
collection of all such permutations. We note that, even if x\ = x 2 , we take 
(x 2 , xi, X3, . . . , Xk) to be a permutation of (x±, x 2 , £3, . . . , Xk)- Clearly, h sym is 
a symmetric function of xi, x 2 , ■ ■ ■ , x^, and we obtain 

-^n(G) ^ hix^j Xi 2 , . . . , Xi k ) ^ ^ h S y ra ( K Xi 1 , Xj 2 , . . . , ) . 

(ii,i 2 v,«fc)eLfc (n,i2,---,«fe)eLfc 

(60) 

The statistic T„(G) is not a U-statistic. Thus we consider the statistic 



i 1= li 2 =l «fc=l 

Tn "I - ^ ^ h S y m (Xj x , Xj 2 , • • • , X i h ) 

l<il,ia, ■■■,ik<n 
not all distinct 

= T n (G)+i? n (G) (say). (61) 

Since T' n {&) is a von Mises' statistic (see [10] . page 39), the asymptotic results 
about T^(G) can be read off from the results about the von Mises' statistics 
with the kernel function h syra . To relate the statistics T n (G) and T' n {&) we 
observe that the number of terms in the sum defining R n is of the order of 
n k ~ 1 ; thus noting that h sym < 1, we have R n = 0(n k ~ l ) as n — > oo. Let 

F(G) :=E[/i(Xi,X 2 ,...,X fc )]. (62) 
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Then, by the i.i.d. nature of {Xj : i > 1}, we have E [h aym (Xi, X 2 , ■ ■ ■ , X k )} 
F(G). 

Theorem 8 As n — > oo ; 

T n (G) 



rr 



F(G) almost surely. 



(63) 



Proof: From Theorem 3.3.1 of |lUj . page 102, we have — > F(G) almost 
surely. Our observation that R n = 0(n fe_1 ) as n — > oo completes the proof of 
the theorem. 

To obtain the central limit theorem, as in (J52)l let 



hi(x) : = E [h sym (x, X 2 , X 3 ,...,X k 



(64) 



and Ci(G) = Var(/ii(X)) where X is an independent random variable identical 
in distribution to X\. Then, from Theorem 4.2.5 (^U|, page 134) we have the 
central limit theorem for T^(G). Now R n (G)/n k — > in probability as n — > oo. 
Thus we obtain 



Theorem 9 As n — > oo ; 



T n (G) 



ir 



F(G) => v/fcCi(G)Z. 



(65) 



5. Local Properties 

In this section we study T n (l) as defined in equation (JSJ). 

For fixed x G R, the kernel h(x,xi,X2) as defined in equation (J1J) is a 
symmetric function of xi and £2. Define a U-statistic based on the kernel 
h(x, ■) by 

T n (l;x):= ^ hfaX^Xj). (66) 

2<i^j<n+l 

We have by the strong law for U-statistics (Theorem A, JT] page 190) 



T n (l; x) 

-> E [/t(x, Xj, X,)] almost surely, as n — > 00. 







(67) 



The random variable T n (l; x) may be easily identified as the number of triangles 
of Go with a fixed vertex 1 and X\ = x. Formally, we may write, for any t 6 R 



E 



ex P I rt-Tsy 



F(dx)E 



ex P ( **-aTa 



X! = X 



F(dx) exp (itE [h(x, X 2 , X 3 )]) 



(68) 
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as n — > oo, because T n (l;x)/(") — > E [h(x, X 2 , X 3 )] almost surely and hence 
also in distribution. The limit is justified by the usual application of dominated 
convergence theorem since the integrand, being a characteristic function, is 
bounded by 1. The right hand side of ()68|) is the characteristic function of the 
random variable L L F(dx\)F(dx2)h{X, x\, a^)- This proves the Theorem El 

As in the previous section we may generalize Theorem El for the subgraph 
G defined earlier. 

6. Spatial model 

Consider the Poisson spatial model as elaborated in Section 1. We first thin 
the underlying Poisson process. Fix x G R. For i > 1, each point £j of 
the original process is included in the thinned process with a probability 
independently of the other points. The thinned process is an 
inhomogeneous Poisson process with intensity function g(y) for y G M given 
by g(y) = Xf(\y\;x), where f(r;x) is as defined in equation (fTTj) . For A r as 
defined in equation ((TQJ) , we have 

Proposition 1 The conditional distribution of A r given that X = x, is 
Poisson with parameter X r (x) where 

X r (x) ■= X f(r;x)df. (69) 

J\f\<r 

We first prove Theorem |H1 where C r (x) — > C(x) < 00 (see Section 1 for 
the relevant definitions). Because t — > L F(dx) exp(— XcdC(x)(l — exp(it))) is 
indeed a characteristic function, it is enough to prove that the characteristic 
function of A r converges to the above quantity. By Proposition [U the 
conditional distribution of A r . given X = x is Poisson with parameter 
f\r\< r Xf(f,x)dr = Xcd f d_1 [l — F{Qf-P — x)]dr = XcdC r (x) where Cd is the 
volume of the (d — l)-dimensional unit sphere. Therefore, we have 

4> Ar (t) := E[exp(itA r )] 

= / F(dx)E [exp(itA r )\X = x] 
Jr 

= / F(dx) exp (-AcC r (aO(l-exp(i*))). (70) 
Jr 

Now, since C r (x) — > C(x), the usual dominated convergence theorem assures 



4>A r (t) — > 4>A(t) as r — > 00. 



(71) 
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Remark Assuming 9 > 0, we may re- write C(x) in the following way: 
r *-i[l - F(9r - x)]dr 

/oo 
l(x > Or 13 - x)F(dx)dr 
-oo 

r d - x l(r < max{0, p + x)/6} 1/ ^})drF(dx) 
[max{0, [(x + x)/#] d//3 }]F(cS) 
[max{0, (x + x) d/f3 }]F(dx) 
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OO POO 



oo JO 

oo 



d6 d IP 

= ^E[max{0,(X + x)^}] 
Thus C(x) < oo if E[|X | d//3 ] < cx). 



(72) 



To show Theorem [?l it is enough to prove that the characteristic function 
of (A r — XcdC r ) I V XcdC r converges to the product of the characteristic functions 
of a standard normal random variable and a/Aq^Xq). 



E 



A r — \CdC r 

exp it- 



a/ \CdC r 



F(dx)E 



exp it 



, A r - Ac d C r 



VXCdPr 



F(dx) exp(—it^/ AqCV)E 



X = x 



exp 2 



X = x 



/ F(dx) exp ( —it a/ XcdC r — XcdC r (x)(l — exp (it/ a/XqCV)) ) 



(73) 



using the fact that the conditional distribution of A r given X Q = x follows a 
Poisson distribution with parameter XcdC r (x). Consider the logarithm of the 
integrand: 

—it a/ Xc d C r — XcdC r (x) 1 — exp | - 



- it a/ AqCV - AqCV(x^ 



it\/ \c n 



P \y/Xc d C r 

it 1 f it 
VXc d C r 2 V VXc d C, 

t 2 C n (x) / 1 

^—^ Ac d C r (x) xo — 



+ o 



( y/\c d Cr 



(74) 
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Under our condition, the first term converges to it^/\cd,g(x). The condition 
also implies that C r (x)/C r — > 1 as r — > oo, and thus the second term converges 
to — 1 2 /2. The third term can be written as [C r (x)/C r ] x [C r o(l/C r )\ — > as 
r — > oo. Applying the dominated convergence theorem, we now obtain that 



E 



exp it 



A r — \c<jC r 



\f\CdPr 



exp J F(dx)exp(ity/Xfyg(x)) 



(75) 



Now note that exp(— 1 2 /2) is the characteristic function of a standard normal 
random variable and j R F(dx) exp (it V Xcdg{x)) is the characteristic function of 
y/Xcdg(Xo)- This completes the proof of Theorem [7| 
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